METHOD, APPARATUS AND PROGRAM FOR TEXT IMAGE PROCESSING 

BACKGROUND OF THE INVENTION 
Field of the Invention 
5 The present invention relates to a method and an apparatus 

for carrying out processing on text image data representing a 
text image. The present invention also relates to a program 
that causes a computer to execute the text image processing 
method. 

10 Description of the Related Art 

A system is known wherein image data obtained by an imaging 
device such as a digital camera or by reading images recorded 
on a film with a scanner are reproduced by an output device such 
as a printer or a monitor. When the image data are reproduced, 

15 a quality of a reproduced image can be improved by carrying out 
various kinds of image processing such as density conversion 
processing, white balance processing, gradation conversion 
processing, chroma enhancement processing, and sharpness 
processing on the image data. 

20 Meanwhile, the spread of mobile phones is remarkable, and 

camera-embedded mobile terminals such as camera phones having 
imaging means for obtaining image data by photography (see 
Japanese Unexamined Patent Publications No. 6 (1994) -233020, 
9(1997)-322114, 2000-253290, and US Patent 6,337,712 for 

25 example) are spreading. By using such a camera-embedded mobile 
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terminal, preferable image data obtained by photography can be 
used as wallpaper of a screen of the terminal. Furthermore, 
image data obtained by a user through photography can be sent 
to a mobile terminal such as a mobile phone or a PDA owned by 
5 his/her friend by being attached to an E-mail message. 
Therefore, for example, in the case where a user needs to cancel 
an appointment or the user seems likely to be late for meeting, 
the user can photograph himself /herself with a sorrowful 
expression and can send the photograph to his/her friend. In 
10 this manner, the user can let his/her friend know a situation 
of the user, which is convenient for communication with the 
friend. 

An image server has also been proposed, with use of an 
image processing apparatus for obtaining processed image data 

15 by carrying out various kinds of image processing on image data 
obtained by photography with a camera phone. Such an image 
server can receive image data sent from a camera-embedded mobile 
terminal and sends processed image data obtained by carrying 
out image processing on the image data to a destination 

20 specified by a user using the camera-embedded mobile terminal. 
Furthermore, the image server can store the image data and can 
send the image data to the camera-embedded mobile terminal upon 
a request input from the mobile terminal. By carrying out the 
image processing on the image data in the image server, a 

25 high-quality image can be used as wallpaper for a screen of the 
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mobile terminal and can be sent to friends. 

Meanwhile, in the case where characters written on a 
medium such as paper or blackboard (hereinafter referred to as 
a text medium) are output as information, text data are 
5 generated by typing the characters or text image data are 
generated by photography of the text medium. However, typing 
is a troublesome operation. Moreover, although the characters 
included in the text image data can be read by reproduction of 
the text image data, the characters are not easy to see if image 
10 processing such as white balance processing is carried out on 
the text image data . 

Furthermore, since a size of readable characters is 
limited, characters included in a text image becomes too small 
and are not easy to see if a text medium such as a blackboard 
15 having a large size is photographed. 

SUMMARY OF THE INVENTION 
The present invention has been conceived based on 
consideration of the above circumstances. An object of . the 
present invention is therefore to easily output information of 
20 characters written on a text medium such as paper. 

A text image processing method of the present invention 
comprises the steps of: 

receiving an input of a text image data set representing 
a text image obtained by photography of a text medium on which 
25 characters are written; 
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obtaining a character code data set by converting the 
characters included in the text image into codes through 
character recognition processing on the text image data set; 
and 

outputting the character code data set. 

The character recognition processing refers to an OCR 
technique whereby the character code data set is obtained 
through pattern recognition carried out on the text image. 

In the text image processing method of the present 
invention, the text image data set may be generated as a 
composite of partial text image data sets obtained by partially 
photographing the text medium while dividing the text medium 
into parts. 

In the text image processing method of the present 
invention, the text image data set may be generated as a 
composite of frame image data sets representing predetermined 
frames cut from a moving image data set obtained by filming the 
text medium. 

The predetermined frames refer to frames enabling 
restoration of the text image data set representing the entire 
text image by generating the composite image from the frame 
image data sets. Filming the text medium refers to 
photographing the text medium while moving the portion of the 
text medium which is being photographed. 

In the text image processing method of the present 
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invention, the text image data set may be stored so that link 
information can be output together with the character code data 
set, for representing where the text image data set, from which 
the character code data set was obtained, is stored. 
5 Furthermore, in the text image processing method of the 

present invention, the character code data set may be converted 
into a voice data set so that the voice data set can be output 
instead of or together with the character code data set. 

In the text image processing method of the present 

10 invention, the text image data set obtained by photography of 
the text medium with a camera-embedded mobile terminal may be 
received from the camera-embedded mobile terminal. In this 
case, the character code data set may be sent to the 
camera-embedded mobile terminal. 

15 A text image processing apparatus of the present 

invention comprises : 

input means for receiving an input of a text image data 
set representing a text image obtained by photography of a text 
medium on which characters are written; 

20 character recognition means for obtaining a character 

code data set by converting the characters included in the text 
image into codes through character recognition processing on 
the text image data set; and 

output means for outputting the character code data set. 

25 The text image processing apparatus of the present 
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invention may further comprise composition means for obtaining 
the text image data set through generation of a composite image 
from partial text image data sets obtained by partially 
photographing the text medium while dividing the text medium 
5 into parts. 

Furthermore, the text image processing apparatus of the 
present invention may further comprise cutting means for 
cutting predetermined frames from a moving image data set 
obtained by filming the text medium; and 
10 composition means for obtaining the text image data set 

through generation of a composite image from frame image data 
sets representing the predetermined frames cut by the cutting 
means . 

Moreover, the text image processing apparatus of the 
15 present invention may further comprise storage means for 
storing the text image data set; and 

link information generation means for generating link 
information representing where the text image data set, from 
which the character code data set was obtained, is stored so 
20 that 

the output means can output the link information together 
with the character code data set. 

In addition, the text image processing apparatus of the 
present invention may further comprise voice conversion means 
25 for converting the character code data set into a voice data 
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set so that 

the output means can output the voice data set instead 
of or together with the character code data set. 

The text image processing apparatus of the present 
5 invention may further comprise communication means for 
receiving the text image data set obtained by photography of 
the text medium with a camera-embedded mobile terminal and sent 
from the camera-embedded mobile terminal, and for sending the 
character code data set to the camera-embedded mobile terminal . 

10 The text image processing method of the present invention 

may be provided as a program for causing a computer to execute 
the text image processing method. 

According to the present invention, the input of the text 
image data set is received, and the characters included in the 

15 text image are converted into the character codes by the 
character recognition processing on the text image data set. 
The character code data set obtained in the above manner is then 
output. Therefore, as long as the text image data set is 
obtained with a digital camera or the like by photography of 

20 the characters written on the text medium such as paper or a 
blackboard, the characters written on the text medium can be 
output as text information represented by the character code 
data set, as a result of application of the text image processing 
method of the present invention to the text image data set. 

25 Consequently, the characters written on the text medium can be 
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displayed as text. 

By obtaining the text image data set as the composite of 
the partial text image data sets obtained by photographing each 
of the parts of the text medium, the characters written over 
the entire text medium having a wide area such as a blackboard 
can be obtained as the character code data set. 

Furthermore, if the predetermined frames are cut from the 
moving image data set obtained by filming the text medium and 
if the text image data set is obtained as the composite of the 
frame image data sets representing the predetermined frames, 
the characters written over the entire text medium having a wide 
area such as a blackboard can be obtained as the character code 
data set. 

By outputting the link information representing where the 
text image data set is stored together with the character code 
data set, the text image data set from which the character code 
data set was obtained can be referred to, according to the link 
information. Therefore, the text image represented by the text 
image data set can be compared to the text represented by the 
character code data set. In this manner, whether or not the 
character code data set has errors therein can be confirmed 
easily. 

Moreover, by converting the character code data set into 
the voice data set and by outputting the voice data set instead 
of the character code data set, an illiterate person or a 
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vision-impaired person can understand the content represented 
by the characters written on the text medium. 

If the text image data set is obtained by photography of 
the text medium with a camera-embedded mobile terminal, the text 
5 medium can be easily photographed, and the character code data 
set representing the text image can be obtained from the text 
image data set. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a block diagram showing the configuration of 
10 a text image communication system adopting a text image 
processing apparatus of a first embodiment of the present 
inventions- 
Figure 2 is a flow chart showing procedures carried out 
in the first embodiment; 
15 Figure 3 is a block diagram showing the configuration of 

a text image communication system adopting a text image 
processing apparatus of a second embodiment of the present 
inventions- 
Figure 4 is a flow chart showing procedures carried out 
20 in the second embodiments- 
Figure 5 is a block diagram showing the configuration of 
a text image communication system adopting a text image 
processing apparatus of a third embodiment of the present 
invention; 

25 Figures 6A and 6B are diagrams for explaining generation 
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of partition information; 

Figure 7 is a flow chart showing procedures carried out 
in the third embodiment; 

Figure 8 is a block diagram showing the configuration of 
5 a text image communication system adopting a text image 
processing apparatus of a fourth embodiment of the present 
invention; 

Figures 9A and 9B are diagrams for explaining addition 
of marks; and 

10 Figure 10 is a flow chart showing procedures carried out 

in the fourth embodiment. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Hereinafter^ embodiments of the present invention will 
be explained with reference to the accompanying drawings. 

15 Figure 1 is a block diagram showing the configuration of a text 
image communication system adopting a text image processing 
apparatus of a first embodiment of the present invention. As 
shown in Figure 1, the text image communication system in the 
first embodiment exchanges data between a text image processing 

20 apparatus 2 and a camera-embedded mobile phone 3 (hereinafter 
referred to as a camera phone 3) via a mobile phone communication 
network 4 . 

The text image processing apparatus 2 comprises 
communication means 21, correction means 22, character 
25 recognition means 23, storage means 24, and link information 
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generation means 25. The communication means 21 carries out 
data communication with the camera phone 3 via the mobile phone 
communication network 4 • The correction means 22 obtains a 
corrected text image data set SI by correcting distortion caused 
by aberration of a camera lens or the like of the camera phone 
3 and occurring on a text image represented by a text image data 
set SO sent from the camera phone 3. The character recognition 
means 23 obtains a character code data set TO by coding 
characters included in the text image represented by the 
corrected text image data set SI, through character recognition 
processing on the corrected text image data set SI . The storage 
means 24 stores various kinds of information such as the 
corrected text image data set SI. The link information 
generation means 25 generates link information LO representing 
the URL of the corrected text image data set SI when the corrected 
text image data set SI is stored in the storage means 24. 

The camera phone 3 can send not only the text image data 
set SO but also image data representing people or scenery, for 
example. Therefore, text image information CO is sent from the 
camera phone 3 together with the text image data set SO, to 
represent the fact that the data sent from the camera phone 3 
represents the text image. Therefore, the text image 
processing apparatus 2 can carry out the character recognition 
processing by recognizing the fact that the data sent from the 
camera phone 3 are the text image data set SO, in the case where 
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the data are sent together with the text image information CO. 
The text image information CO includes model information 
regarding the camera phone 3. 

The correction means 22 corrects the distortion occurred 
in the text image due to aberration of the camera lens, for 
example. The storage means 24 has correction information in 
accordance with the model of the camera phone 3. Therefore, 
the correction means 22 obtains the correction information 
corresponding to the mode of the camera phone 3 that obtained 
the text image data set SO, based on the model information of 
the camera phone 3 included in the text image information CO 
sent from the camera phone 3 together with the text image data 
set SO. Based on the correction information, the correction 
means 22 corrects the distortion of the text image represented 
by the text image data set SO, and obtains the corrected text 
image data set SI. 

The character recognition means 23 obtains the character 
code data set TO from the corrected text image data set SI, by 
using an OCR technique for obtaining character codes through 
pattern recognition . 

The character code data set TO is sent from the 
communication means 21 to the camera phone 3 via the mobile phone 
communication network 4, together with the link information LO 
comprising the URL of where the corrected text image data set 
SI is stored. The character code data set TO is displayed as 
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text on the camera phone 3 , 

The camera phone 3 comprises a camera 31 for obtaining 
image data representing a subject by photography of the subject^ 
a liquid crystal display monitor 32 for displaying images and 
5 commands, operation buttons 33 comprising ten keys and the like, 
and a memory 34 for storing various kinds of information. 

A user of the camera phone 3 obtains the text image data 
set SO representing the text image by photography of the 
characters written on a text medium such as paper or a blackboard. 

10 In response to a transfer operation of the buttons 33 by the 
user, the text image data set SO is sent to the text image 
processing apparatus 2 via the mobile phone communication 
network 4. At this time, the text image information CO 
representing the fact that the image data set is the text image 

15 data set is also sent together with the text image data set SO. 

The character code data set TO sent from the text image 
processing apparatus 2 is displayed as the text on the liquid 
crystal display monitor 32. The link information LO is also 
displayed as the URL on the monitor 32. 

20 The operation of the first embodiment will be explained 

next. Figure 2 is a flow chart showing procedures carried out 
in the first embodiment. The user photographs the characters 
written on the text medium such as paper or blackboard by using 
the camera phone 3, and obtains the text image data set SO (Step 

25 SI) . Monitoring is started as to whether or not the user has 
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carried out a transfer instruction operation is then started 
(Step S2) . When a result at Step S2 becomes affirmative, the 
text image data set SO and the text image information CO are 
sent to the text image processing apparatus 2 via the mobile 
5 phone communication network 4 (Step S3) • 

In the text image processing apparatus 2, the 
communication means 21 receives the text image data set SO and 
the text image information CO (Step 84) . The correction means 
22 reads the correction information corresponding to the model 

10 of the camera phone 3 from the storage means 24, and corrects 
the distortion of the text image caused by aberration of the 
camera lens or the like. In this manner, the corrected text 
image data set SI is obtained (Step S5) . The character 
recognition means 23 carries out pattern recognition on the 

15 corrected text image data set SI, and obtains the character code 
data set TO representing the character codes (Step S6) . The 
corrected text image data set SI is stored in the storage means 
24 (Step S7) , and the link information generation means 25 
generates the link information LO having the URL of where the 

20 corrected text image data set SI is stored (Step S8) . The 
character code data set TO and the link information LO are sent 
from the communication means 21 to the camera phone 3 via the 
mobile phone communication network 4 (step S9) . 

In the camera phone 3, the character code data set TO and 

25 the link information LO are received (Step SIO) , and the text 
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represented by the character code data set is displayed on the 
liquid crystal display monitor 32 (Step Sll) . Monitoring is 
started as to whether or not the user carries out a display 
instruction operation regarding the URL represented by the link 
5 information LO by using the buttons 33 (Step S12) . If a result 
at Step S12 is affirmative, the URL represented by the link 
information LO is displayed in the liquid crystal display 
monitor 32 (Step S13) to end the process. 

As has been described above, according to the first 

10 embodiment, the text image processing apparatus 2 carries out 
the character recognition processing on the corrected text 
image data set SI, and the characters included in the text image 
represented by the text image data set SI are coded as the 
character code data set TO. The character code data set TO is 

15 then sent to the camera phone 3. Therefore, as long as the user 
of the camera phone 3 photographs the characters written on the 
text medium such as paper or a blackboard with use of the camera 
phone 3, the characters can be displayed on the liquid crystal 
display monitor 32 as the text, without a typing operation 

20 regarding the characters. When a text image is displayed, 
characters therein may not be easy to see, due to image 
processing carried out thereon. However, since the characters 
can be displayed as the text in this embodiment, the problem 
of hard-to-see characters can be avoided. 

25 By outputting the link information LO of the corrected 
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text image data set SI obtained by correction of the text image 
data set SO from which the character code data set TO was obtained, 
the corrected text image data set SI can be referred to by access 
to the URL represented by the link information LO . Therefore, 
5 the text image represented by the corrected text image data set 
SI can be compared with the text represented by the character 
code data set TO, and presence or absence of an error in the 
character code data set TO can be confirmed easily. 

A second embodiment of the present invention will be 

10 explained next. Figure 3 is a block diagram showing a 
configuration of a text image communication system adopting a 
text image processing apparatus of the second embodiment of the 
present invention. In the second embodiment , the same elements 
as in the first embodiment have the same reference numbers, and 

15 detailed explanations thereof will be omitted. In the second 
embodiment, the text image processing apparatus 2 further 
comprises voice conversion means 27 for converting the 
character code data set TO into a voice data set VO . 

The voice conversion means 27 converts the characters 

20 represented by the character code data set TO into the voice 
data set VO representing a synthetic voice that imitates a human 
voice. The voice (such as a man's or a woman's voice, or the 
voice of a famous person) may be changed by an instruction from 
the camera phone 3 . 

25 The operation of the second embodiment will be explained 
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next. Figure 4 is a flow chart showing procedures carried out 
in the second embodiment. The user photographs the characters 
written on the text medium by using the camera phone 3, and 
obtains the text image data set SO (Step S21) . Monitoring is 
5 started as to whether or not the user has carried out the transfer 
instruction operation (Step S22) . When a result at Step S22 
becomes affirmative, the text image data set SO and the text 
image information CO are sent to the text image processing 
apparatus 2 via the mobile phone communication network 4 (Step 
10 S23) . 

In the text image processing apparatus 2, the 
communication means 21 receives the text image data set SO and 
the text image information CO (Step S24) . The correction means 
22 reads the correction information corresponding to the model 

15 of the camera phone 3 from the storage means 24, and corrects 
the distortion of the text image caused by aberration of the 
camera lens or the like. In this manner, the corrected text 
image data set SI is obtained (Step S25) . The character 
recognition means 23 carries out pattern recognition on the 

20 corrected text image data set SI, and obtains the character code 
data set TO (Step S26) . The voice conversion means 27 converts 
the character code data set TO into the voice data set VO (Step 
S27) . 

The corrected text image data set SI is stored in the 
25 storage means 24 (Step S28) , and the link information generation 
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means 25 generates the link information LO having the URL of 
where the corrected text image data set SI is stored (Step S29) . 
The character code data set TO, the link information LO, and 
the voice data set VO are sent from the communication means 21 
5 to the camera phone 3 via the mobile phone communication network 
4 (step S30) . 

In the camera phone 3, the character code data set TO, 
the link information LO, and the voice data set VO are received 
(Step S31) , and the text represented by the character code data 

10 set TO is displayed on the liquid crystal display monitor 32 
(Step S32) . The voice data set VO is also reproduced as an 
audible voice (Step S33) . Monitoring is started as to whether 
or not the user carries out the display instruction operation 
regarding the URL represented by the link information LO, by 

15 using the buttons 33 (Step S34) . If a result at Step S34 is 
affirmative, the URL represented by the link information LO is 
displayed in the liquid crystal display monitor 32 (Step S35) 
to end the process. 

As has been described above, according to the second 

20 embodiment, the voice data set VO is sent to the camera phone 
3 together with the character code data set TO and the link 
information LO . The text represented by the character code data 
set TO is displayed on the liquid crystal display monitor 32, 
and the voice data set VO is also reproduced. Therefore, the 

25 text displayed on the monitor 32 is read. In this manner, the 
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content of the text image can be understood even if the user 
cannot read the text. 

A third embodiment of the present invention will be 
explained next. Figure 5 is a block diagram showing a 
5 configuration of a text image communication system adopting a 
text image processing apparatus of the third embodiment of the 
present invention. In the third embodiment, the same elements 
as in the first embodiment have the same reference numbers, and 
detailed explanations thereof will be omitted. In the third 

10 embodiment, the user of the camera phone 3 photographs the text 
medium such as paper or blackboard divided into several parts, 
and obtains partial text image data sets DSO. The partial text 
image data sets DSO are sent to the text image processing 
apparatus 2. The partial text image data sets DSO are corrected 

15 and corrected partial text image data sets DSl are then 
generated. The corrected partial text image data sets DSl are 
put together by composition means 28 to generate a text image 
data set S2 as a composite of the corrected partial text image 
data sets DSl. 

20 The camera phone 3 generates partition information DO 

representing how the text image was photographed, and sends the 
partial text image data sets DSO and the partition information 
PO to the text image processing apparatus 2. Figures 6A and 
6B show how the partition information DO is generated. As shown 

25 in Figure 6A, in the case where, the text medium is partitioned 
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into areas A1-'A4 to be photographed, the camera phone 3 adds 
information of the areas from which the partial text image data 
sets DSO are obtained (such as a code like Al) to tag information 
of the partial text image data sets DSO. Meanwhile, as shown 
5 in Figure 6B, the partition information DO represents an image 
that shows an entire area of the text image to be restored and 
the code for specifying each of the partial text image data sets 
DSO to be inserted in the corresponding area of the text image. 
The tag information is also added to the corrected partial text 

10 image data sets DSl obtained by correction of the partial text 
image data sets DSO. 

The composition means 28 refers to the partition 
information DO and the tag information added to the corrected 
partial text image data sets DSl, and obtains the text image 

15 data set S2 representing the text image including the characters 
written on the photographed text medium by putting together the 
corrected partial text image data sets DSl. 

The operation of the third embodiment will be explained 
next. Figure 7 is a flow chart showing procedures carried out 

20 in the third embodiment. The user using the camera phone 3 
photographs the characters written on the text medium by 
dividing the text medium into the areas, and obtains the partial 
text image data sets DSO (Step S41) . Monitoring is started as 
to whether or not the data transfer instruction operation has 

25 been carried out (Step S42) . When a result of the judgment at 
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step S42 becomes affirmative, the partial text image data sets 
DSO, the text image information CO, and the partition 
information DO are sent to the text image processing apparatus 
2 via the mobile phone communication network 4 (Step S43) . 
5 The text image processing apparatus 2 receives the 

partial text image data sets DSO, the text image information 
CO, and the partition information DO by using the communication 
means 21 (Step S44) . The correction means 22 reads the 
correction information corresponding to the model of the camera 

10 phone 3 from the storage means 24, and corrects the distortion 
of the text image caused by aberration of the camera lens or 
the like. In this manner, the corrected partial text image data 
sets DSl are obtained (Step S45) . The composition means 28 puts 
together the corrected partial text image data sets DSl 

15 according to the partition information DO, and obtains the text 
image data set S2 (Step S4 6) . 

The character recognition means 23 carries out pattern 
recognition on the text image data set S2, and obtains the 
character code data set TO representing the character codes 

20 (Step S47) , 

The text image data set S2 is stored in the storage means 
24 (Step S48), and the link information generation means 25 
generates the link information LO representing the URL of where 
the text image data set S2 is stored (Step S49) . The character 

25 code data set TO and the link information LO are then sent from 
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the communication means 21 to the camera phone 3 via the mobile 
phone communication network 4 (Step S50) . 

The camera phone 3 receives the character code data set 
TO and the link information LO (Step S51), and the character 
code data set TO is displayed as text on the liquid crystal 
monitor 32 (step S52) . Monitoring is started as to whether or 
not the instruction for displaying the URL represented by the 
link information LO is input from the buttons 33 (Step S53) . 
If a result at Step S53 is affirmative, the URL is displayed 
on the liquid crystal display monitor 32 (Step S54) to end the 
process . 

As has been described above, according to the third 
embodiment, the text image data set S2 is obtained as the 
composite of the partial text image data sets DSO obtained by 
photography of the text medium divided into the areas, and the 
character code data set TO is obtained by character recognition 
on the text image data set S2 . Therefore, even if the characters 
are written on the text medium having a large area such as a 
blackboard, the characters can be obtained as the character code 
data set TO by partially photographing the text medium divided 
into the areas. 

A fourth embodiment of the present invention will be 
explained next . Figure 8 is a block diagram showing a text image 
communication system adopting a text image processing apparatus 
of the fourth embodiment of the present invention. In the 
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fourth embodiment^ the same elements as in the first embodiment 
have the same reference numbers, and detailed explanations 
thereof will be omitted. In the fourth embodiment, the user 
using the camera phone 3 obtains a moving text image data set 
5 MO by filming the characters written on the text medium, and 
the moving text image data set MO is sent to the text image 
processing apparatus 2 wherein character recognition is carried 
out. Therefore, the text image processing apparatus 2 
comprises cutting means 41 for cutting from the moving text 

10 image data set MO frame data sets DS3 that are necessary for 
generating a composite image representing the text image, and 
composition means 42 for generating a text image data set S3 
by generating the composite image from the frame data sets DS3. 

In the camera phone 3, marks that are necessary for cutting 

15 the frame data sets DS3 are added to the moving text image data 
set MO, and the moving text image data set MO added with the 
marks is sent to the text image processing apparatus 2. Figures 
9A and 9B show how the marks are added. As shown in Figure 9A, 
the text medium is filmed as if the characters such as abcdefg 

20 written thereon are traced. In this manner, the moving text 
image data set MO is obtained. During the photography, when 
a frame F displayed in a finder of the camera phone 3 is 
positioned at the center of each of the areas A1-A4, each of 
the marks is added to the moving text image data MO in response 

25 to an instruction input by the user from the buttons 33. 
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The cutting means 41 cuts the frames added with the marks, 
and generates the frame data sets DS3 that are necessary for 
generating the text image data set S3 as the composite image. 

The composition means 42 generates the composite image 
5 from the frame data sets DS3, and obtains the text image data 
set S3 representing the text image including the characters 
written on the entire text medium. 

The operation of the fourth embodiment will be explained 
next. Figure 10 is a flow chart showing procedures carried out 

10 in the fourth embodiment. The user of the camera phone 3 films 
the characters written on the text medium in the above manner, 
and obtains the moving text image data set MO (Step S61) . 
Monitoring is started as to whether or not the data transmission 
has been instructed (Step S62) . If a result of the judgment 

15 at Step S62 becomes affirmative, the moving text image data set 
MO and the text image information CO are sent to the text image 
processing apparatus 2 via the mobile phone communication 
network 4 (Step S63) . 

The text image processing apparatus 2 receives the moving 

20 text image data set MO and the text image information CO by using 
the communication means 21 (Step S64) . The correction means 
22 reads the correction information corresponding to the model 
of the camera phone 3 from the storage means 24, and corrects 
the distortion of the text image caused by aberration of the 

25 camera lens or the like. In this manner, a corrected moving 
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text image data set Ml is obtained (Step S65) . The cutting means 
41 cuts the frame data sets DS3 from the corrected moving text 
image data set Ml, according to the marks added to the corrected 
moving text image data set Ml (Step S66) . The composition means 
5 42 puts together the frame data sets DS3, and obtains the text 
image data set S3 as the composite thereof (Step S67) . 

The character recognition means 23 carries out pattern 
recognition on the text image data set S3, and obtains the 
character code data set TO representing the character codes 
10 (Step S68) . 

The text image data set S3 is stored in the storage means 
24 (Step S69) , and the link information generation means 25 
generates the link information LO representing the URL of where 
the text image data set S3 is stored (Step S70) . The character 
15 code data set TO and the link information LO are then sent from 
the communication means 21 to the camera phone 3 via the mobile 
phone communication network 4 (Step S71) , 

The camera phone 3 receives the character code data set 
TO and the link information LO (Step S72), and the character 
20 code data set TO is displayed as text on the liquid crystal 
monitor 32 (step S73) . Monitoring is started as to whether or 
not the instruction for displaying the URL represented by the 
link information LO is input from the buttons 33 (Step S74) . 
If a result at Step S53 is affirmative, the URL is displayed 
25 on the liquid crystal display monitor 32 (Step S75) to end the 
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process . 

As has been described above, according to the fourth 
embodiment, the frame data sets DS3 are cut from the moving text 
image data set Ml obtained by filming the text medium, and the 
5 text image data set S3 to be subjected to the character 
recognition is obtained by generating the composite image from 
the frame data sets DS3. Therefore, even if the characters are 
written on the text medium having a large area such as a 
blackboard, the characters can be obtained as the character code 
10 data set TO by filming the text medium. 

In the third and fourth embodiments of the present 
invention, the voice conversion means 27 may be installed in 
the text image processing apparatus 2, as in the second 
embodiment, so that the voice data set VO obtained by conversion 
15 of the character code data set TO can be sent to the camera phone 
3. 

In the first to fourth embodiments described above, in 
the case where the characters are often written by the same 
person, characteristics of handwriting of the person are 

20 preferably stored in the storage means 24. In this case, 
information for identifying the person is also sent to the text 
image processing apparatus 2 together with the text image data 
set SO or the like, and the text image processing apparatus 2 
obtains the character code data set TO by using the character 

25 recognition means 23 in consideration of the characteristics, 
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based on the information. 

By considering the characteristics of the handwriting of 
the person who wrote the characters, accuracy of the character 
recognition by the character recognition means 23 can be 
improved. 

In the first to fourth embodiments described above, the 
camera phone 3 photographs the text medium. However, the text 
medium may be photographed by any camera-embedded mobile 
terminal, such as a camera-embedded PDA and a digital camera 
having a communication function, for generating the text image 
data set. The text image data set is sent to the text image 
processing apparatus 2, and the mobile terminal displays the 
character code data set TO as text. 
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